Statistical estimation of unreliable features for robust speech recognition
نویسندگان
چکیده
This paper addresses the problem of robust speech recognition in noisy conditions in the framework of hidden Markov models (HMMs) and missing feature techniques. It presents a new statistical approach to detection and estimation of unreliable features based on a probabilistic measure and Gaussian mixture model (GMM). In the estimation process, the GMM is compensated using parameters of the statistical model of additive background noise. The GMM means are used to replace the unreliable features. The GMM based technique is less complex than the corresponding HMM based estimation and gives similar improvement in the recognition performance. Once unreliable features are replaced by the estimated clean speech features, the entire set of spectral features can be transformed to the other feature domain characterized by higher baseline recognition rate (e.g MFCCs) for final recognition using continuous density hidden Markov models (CDHMMs) with diagonal covariance matrices.
منابع مشابه
روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملHMM-based estimation of unreliable spectral components for noise robust speech recognition
This paper presents a novel approach for reconstructing unreliable spectral components, which utilizes HMM-based missing feature algorithms, and applies them to noise robust speech recognition. The proposed technique uses the forwardbackward algorithm to estimate corrupt spectrographic data based on nearby reliable features, noisy observations, and on an underlying statistical model. The estima...
متن کاملAn Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کاملDetection of Reliable Features for Speech Recognition in Noisy Conditions Using a Statistical Criterion
This paper addresses the problem of integration of missing data theory in the context of robust speech recognition in additive noise. It shows that techniques based on statistical estimation and thresholding of a posteriori signal-to-noise ratio (SNR) can be used for the detection of reliable (not much affected by noise) features as opposed to unreliable or missing (masked by noise) features. I...
متن کاملRobust speech recognition using missing feature theory in the cepstral or LDA domain
When applying Missing Feature Theory to noise robust speech recognition, spectral features are labeled as either reliable or unreliable in the time-frequency plane. The acoustic model evaluation of the unreliable features is modified to express that their clean values are unknown or confined within bounds. Classically, MFT requires an assumption of statistical independence in the spectral domai...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000